-
Notifications
You must be signed in to change notification settings - Fork 13
Add MoE prepare input kernels #29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
- restructure moe kernels folder - add prepare moe inputs kerel Signed-off-by: kareem <[email protected]>
Signed-off-by: kareem <[email protected]>
Signed-off-by: kareem <[email protected]>
Signed-off-by: kareem <[email protected]>
Signed-off-by: kareem <[email protected]>
Signed-off-by: kareem <[email protected]>
Signed-off-by: Shaik, Kareem M <[email protected]>
Signed-off-by: kareem <[email protected]>
Signed-off-by: kareem <[email protected]>
adityachatter
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
msinnha1
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One generic comment also to use error handling in the functions:
example, in void prepare_moe_input()
TORCH_CHECK(topk_ids.dtype() == torch::kInt32, "topk_ids must be int32");
...
Signed-off-by: Shaik, Kareem M <[email protected]>
Signed-off-by: Shaik, Kareem M <[email protected]>
Signed-off-by: Shaik, Kareem M <[email protected]>
Signed-off-by: Shaik, Kareem M <[email protected]>
|
@kareemshaik80 please rebase with the latest main |
airMeng
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
generally follow SGLang CUDA kernel, LGTM except some minor comments
mingfeima
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good job for this one!
just some minor places to change, and it shall be fine.
|
@kareemshaik80 could you please also collect how much is the ratio of |
Signed-off-by: Shaik, Kareem M <[email protected]>
airMeng
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add the new UT into https://github.com/sgl-project/sgl-kernel-xpu/blob/main/tests/run_suite.py
I forgot it too :)
Signed-off-by: Shaik, Kareem M <[email protected]>
@kareemshaik80 good work here! please continue to analysis how much overhead the shuffle takes, you can share data internally. |
…project#57)" This reverts commit eb9cfca.
* Revert "Revert "Add MoE prepare input kernels (#29)" (#57)" This reverts commit eb9cfca. Signed-off-by: Shaik, Kareem M <[email protected]>
* Restructure MoE and add prepare inputs/meta kernel - restructure moe kernels folder - add prepare moe inputs kerel Signed-off-by: kareem <[email protected]> * fix minor issues Signed-off-by: kareem <[email protected]> * Add tests Signed-off-by: kareem <[email protected]> * Add shuffle_rows Kernel Signed-off-by: kareem <[email protected]> * register shuffle_rows Signed-off-by: kareem <[email protected]> * Enable Build and Add apply_shuffle_mul_sum kernel Signed-off-by: kareem <[email protected]> * functional Signed-off-by: Shaik, Kareem M <[email protected]> * cleanup Signed-off-by: kareem <[email protected]> * cleanup1 Signed-off-by: kareem <[email protected]> * Modify fused expert to invoke moe_kernels and increase test coverage Signed-off-by: Shaik, Kareem M <[email protected]> * Cleanup makefile Signed-off-by: Shaik, Kareem M <[email protected]> * remove debug code Signed-off-by: Shaik, Kareem M <[email protected]> * fix lint Signed-off-by: Shaik, Kareem M <[email protected]> * Fix review comments Signed-off-by: Shaik, Kareem M <[email protected]> * Add to CI Signed-off-by: Shaik, Kareem M <[email protected]> --------- Signed-off-by: kareem <[email protected]> Signed-off-by: Shaik, Kareem M <[email protected]>
This reverts commit ac9e2a7.
* Revert "Revert "Add MoE prepare input kernels (sgl-project#29)" (sgl-project#57)" This reverts commit eb9cfca. Signed-off-by: Shaik, Kareem M <[email protected]>
Uh oh!
There was an error while loading. Please reload this page.